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Abstract 



in 

In the context of Lorentz-invariant massive gravity we show that classical solutions around heavy sources 
are plagued by ghost instabilities. The ghost shows up in the effective field theory at huge distances 
from the source, much bigger than the Vainshtein radius. Its presence is independent of the choice of the 
*^ ■ non-linear terms added to the Fierz-Pauli Lagrangian. At the Vainshtein radius the mass of the ghost 
Qh. is of order of the inverse radius, so that the theory cannot be trusted inside this region, not even at the 
J|h | classical level. 



1 Introduction 



In recent years there has been renewed interest in the possibility of giving a mass to the graviton. This 
idea belongs to a broader class of proposals for modifying gravity at large distances. Besides their 
theoretical interest, these models could be phenomenologically relevant as possible alternatives to dark 
matter and dark energy. In this paper we reconsider the issue of the range of validity of massive gravity; 
in particular we concentrate on the stability of classical solutions around massive sources. 

The problem we want to address has a long history. Already in the first paper Fierz and Pauli 
observed that the mass term must be of the form m^(/i 2 — h^h^"), otherwise a ghost appears in the 
spectrum, besides the 5 degrees of freedom of the massive spin-2 graviton. A different structure would 
result in an instability at an energy scale ~ m g . Unfortunately Boulware and Deser showed that this 
additional degree of freedom propagates when nonlinearities in the action are taken into account |2J. 
However, from an effective field theory point of view this is not necessarily a problem, until one specifies 
the scale at which the ghost shows up, i.e. its mass. If this scale is above the UV cutoff A of the effective 
theory this instability can be consistently disregarded. 
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On the other hand, non-linearities of the classical theory are also the solution to the problem raised by 
van Dam, Veltman, and Zakharov (vDVZ) [Hill]: in the linearized theory predictions are not continuous 
in the limit m g — > 0, because the helicity-0 component of the graviton does not decouple from matter. 
However, Vainshtein [2] observed that the vDVZ discontinuity might not be relevant for macroscopic 
sources because the linearized approximation around a source of mass M* breaks down at a distance 
Ry = (M*Mp 2 m~ 4: ) l l^ which diverges for m g — > 0. Classical nonlinearities become important, and the 
full non-linear solution could be in perfect agreement with experiments. This is still an open issue in 
massive gravity, but the Vainshtein effect has been shown to work in a closely related model, the DGP 
model 00 IB]- 

More recently massive gravity has been reconsidered in the effective field theory language, which 
provides a systematic framework for dealing with quantum effects [2]. For this purpose it is useful to 
restore the broken diffeomorphism invariance by introducing a set of Goldstone bosons. With this method 
it is easy to see that the scalar longitudinal component of the graviton becomes strongly coupled at a 
very low energy scale, much lower than what naively expected by analogy with the spin-1 case. In the 
Fierz-Pauli theory the strong interaction scale is A5 ~ (m^Mp) 1 / 5 , which for m g of order of the present 
Hubble parameter is (10 11 km) -1 . By adding to the Fierz-Pauli Lagrangian a set of properly tuned 
interactions of the form h™ u the cutoff can be raised up to A3 ~ (m g Mp) 1 / 3 ~ (1000 km)" 1 0HU]. 

In both cases the theory seems to lose predictivity at very large distances. For instance one can 
wonder how this strong coupling affects the gravitational potential generated by an astrophysical source. 
Apparently the potential is uncalculable at distances smaller than 1000 km; but in principle this could not 
be the case. After all the strong coupling takes place in the Goldstone sector, and inside the Vainshtein 
radius, if the Vainshtein effect applies, one expects the Goldstone to give a negligible correction to 
the Newtonian potential. Whatever quantum effects take place at the cutoff distance, they could be 
sufficiently screened from experiments. Nevertheless, without further assumptions, from an effective 
theory point of view one should include in the Lagrangian all the possible operators allowed by the 
symmetries and weighted by the cutoff. In this case the effective theory loses predictivity at a much 
larger length scale: these higher dimension operators all become important at a huge distance from the 
source when evaluated on the classical solution. In the improved theory with cutoff A3 this happens 
at the corresponding Vainshtein radius Ry ~ (M*Mp 2 m~ 2 ) 1 ^ . For the sun Ry is ~ 10 16 km. This 
means that we are unable to compute the gravitational potential at distances shorter than Ry without a 
UV completion. As a consequence there is no range of distances where nonlinear effects can be reliably 
computed in the effective field theory. If we restrict to the original theory with cutoff A5 the situation is 
even worse. In this case the infinite tower of higher dimension operators become important at a distance 
which is parametrically larger than the corresponding Vainshtein radius [S]. 

The picture looks very similar to the DGP model where the same problems have been pointed out 
|12| . In the DGP model our world is the 4D boundary of an infinite 5D spacetime. Gravity is described 
by a standard 5D Einstein-Hilbert action with Planck mass M5 in the bulk and by an additional 4D 
Einstein-Hilbert action localized on the boundary, with a much larger Planck mass M4. The resulting 
Newton's law is 4-dimensional below the critical length scale Ldgp = Mj/Mf and 5-dimensional at larger 
distances. From the 4D viewpoint there is a scalar degree of freedom, the brane bending mode tt, whose 
dynamics is closely related to that of the longitudinal Goldstone boson eft of massive gravity. In particular 
strong interactions show up in the tt sector at a tiny energy scale Adgp ~ (^p/^dgp) 1 ^ 3 ~ (1000 km) -1 
(taking Ldgp of order of the present Hubble horizon Hq 1 ). Also if one includes in the Lagrangian all 
possible operators allowed by the symmetries and suppressed by Adgp> around a heavy source they all 
become important at the Vainshtein radius Ry ~ (M*Mp 2 Lp G p) 1//3 when evaluated on the classical 
solution. 
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However all these difficulties depend on assumptions about the UV completion. In the DGP model 
one can consistently assume a UV completion such that the effective theory is predictive down to distances 
significantly shorter than 1 / Adgp • F° r instance on the surface of the earth the cutoff can be pushed up 
to ~ cm^ 1 , not far from the smallest length scale ~ 100 /im at which gravity has been experimentally 
tested. Of course a necessary requirement for this to be possible is the consistency of the classical theory: 
in particular in DGP no classical instability develops in the tt sector for all relevant astrophysical sources 
and for a large class of cosmological solutions [Ej- in this paper we want to study whether the same 
stability properties hold for massive gravity. This is a basic consistency requirement one has to satisfy 
before analyzing the theory at the quantum level and looking for a mechanism, analogous to that working 
in DGP, that can make the theory predictive in a phenomenologically interesting range of scales. 

The most convenient way to study the dynamics of the theory is to use the Goldstone formalism, that 
we review in section |3 for our purposes the main interesting features of the model are encoded in the 
Lagrangian of the longitudinal component cp of the Goldstone vector. If we start with a Fierz-Pauli mass 
term, the dominant interactions for this scalar degree of freedom are cubic self-couplings with 6 derivatives 
of the form (d 2 <ft) 5 . In the presence of a macroscopic source the Goldstone gets a non-trivial configuration 
$(x); in order to study the stability of such a solution it is necessary to expand the action at quadratic 
order in the fluctuations around <I>(x). It is evident that in general, because of the cubic self-coupling, the 
fluctuations will get a higher-derivative kinetic term. As we discuss in section this signals the presence 
of a ghost-like instability already at the classical level. In DGP this does not happen: although the ir 
cubic self-coupling has 4 derivatives, its tensorial structure is such that fluctuations around a background 
get only a 2-derivative kinetic term j!21 113j . Unfortunately, this does not work for the Fierz-Pauli theory. 
Still, we have a large freedom in choosing the non-linear extension of FP, and one can wonder if it is 
possible to cancel all higher derivatives terms and end up with a ghost-free theory. Sections 0] and [S] 
contain the answer: despite the freedom we have, ghost-like instabilities are unavoidable. 

In the Goldstone language it is also easy to compute the scale at which this instability appears. Even in 
the most favorable setup in which the cutoff is A3 , the ghost enters in the effective field theory at distances 
from the source parametrically larger than the (already huge) Vainshtein radius Ry. Furthermore, when 
in approaching the source we reach r = Ry the mass of the ghost has dropped to 1/Ryl This means 
that in no way the theory can be extrapolated inside the Vainshtein radius. 

The last part of the paper is devoted to discuss how the sickness of the theory is interpreted in the 
unitary gauge. Clearly we expect the ghost we found to be the troublesome sixth degree of freedom. 
With the Fierz-Pauli mass term, at quadratic level this degree of freedom does not appear because the 
trace of the Einstein equations gives a constraint instead of a propagating equation. In section El we show 
that this equation becomes dynamical in the presence of a curved background; we qualitatively estimate 
in this simple case the mass of this new excitation and the result agrees with the mass we find for the 
ghost in the Goldstone computation. Then, in section [7[ we show in the Hamiltonian formalism that 
there exists no non-linear extension of the Fierz-Pauli theory that can forbid the propagation of the sixth 
mode in the presence of a slightly curved background. The analysis in the unitary gauge is powerful for 
counting the number of degrees of freedom, but, unlike the Goldstone analysis, it says nothing about the 
typical scales of these modes. Also in order to address stability issues one should study the positivity 
of the Hamiltonian. This in general is difficult, and the analysis has been carried out by Boulware and 
Deser for non-linear extensions of the form j(h ilv h iLU — h 2 ) [2]. On the contrary the Goldstone analysis 
concentrates from the very beginning on the strong interacting degree of freedom: all the interesting and 
troublesome features of the theory are encoded in the dynamics of a single scalar field. This enormously 
simplifies the analysis. 

Recently it has been realized that massive gravity models with Lorentz violating mass terms can be 
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significantly 'healthier' than the traditional Lorentz-invariant theory; it particular they can avoid the 
vDVZ discontinuity and the strong coupling problem, and they can be free of ghosts |14 |ll5llT6| . In this 
paper we stick to the Lorentz-invariant massive gravity theory. 

2 Ghosts from higher derivative kinetic terms 

Let us first be very specific about why higher derivative kinetic terms give rise to ghost-like instabilities. 
Take for instance a massless scalar field <f> with Lagrangian density (note that we are using the (— , +, +, +) 
signature!) 

£ = -i(^) 2 + ^(D0) 2 -Mnt(0), (1) 

where A is some energy scale, a = ±1, and V- m t is a self-interaction term. We show that, independently 
of the sign of the second term, the system is plagued by ghosts. To do so we want to reduce to a purely 
two-derivative kinetic Lagrangian, from which we know how to read the stability properties of the system. 
We therefore introduce an auxiliary scalar field x an d a new Lagrangian 

C! = -\{d<\>f - a dyx&t ~ h A ' X 2 - Vtotf) , (2) 

which reduces exactly to C once x 1S integrated out. CJ is diagonalized by the substitution <p = <p' — ax- 
We get 

C! = -\{d<j>'f + \{d X f - \a A 2 x 2 ~ Vtotf, x) , (3) 

which clearly signals the presence of a ghost: x has a wrong-sign kinetic term. Notice in passing that x 
can also be a tachyon, for a = — 1: in this case x has exponentially growing modes. But let us neglect 
this possibility and concentrate on the ghost instability, which is unavoidable. A ghost, unlike a tachyon, 
is not unstable by itself: its equation of motion is perfectly healthy at the linear level, and does not 
admit any exponentially growing solution. The problem is that its Hamiltonian is negative, so that 
when couplings to ordinary 'healthy' matter are taken into account (the potential term in our example 
above) the system is unstable: with zero net energy one can indiscriminately excite both sectors, and this 
exchange of energy happens spontaneously already at classical level. In a quantum system with ghosts in 
the physical spectrum this translates into an instability of the vacuum. The decay rate is UV divergent 
due to an infinite degeneracy of the final state phase space. It is not clear how to cutoff this divergence 
in a Lorentz invariant way |17j . 

However the situation is not as bad as it seems: our ghost x i n ec L- © has a (normal or tachyonic) 
mass A, so that it will show up only at energies above A, i.e. when the four derivative kinetic term in 
eq. © starts dominating over the usual two derivative one. We can consistently use our scalar field 
theory eq. © at energies below A, and postulate that some new degree of freedom enters at A and takes 
care of the ghost instability. For example, we can add a term — (dx) 2 to eq. © (for simplicity we stick 
to the non-tachyonic case a = +1 and set V^t = 0)) 

Ajv = -\{d<t>? ~ drfW ~ (d X ) 2 - l 2 A2 X 2 - (4) 

This drastically changes the high-energy picture, since the resulting Lagrangian obtained by demixing 
now describes two perfectly healthy scalars, one massless and the other with mass A. At the same time, 
at energies below A the heavy field x can be integrated out from £\jy, thus giving the starting Lagrangian 
eq. © up to terms suppressed by additional powers of (<9/A) 2 . This example shows that in principle the 
ghost instability can be cured by proper new physics at the scale A. In other words, eq. © makes perfect 
sense as an effective field theory with UV cutoff A. 
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3 The Goldstone action 



In this section we briefly re-derive the Lagrangian of massive gravity along the lines of i.e. keeping 
explicit the Goldstone bosons of broken diffeomorphism invariance. 

To write down a mass term for gravity, in addition to the full dynamical metric g^ u , we have to take 
a reference fixed metric: for our purposes we will take the Minkowski metric r]^ u . A mass term breaks 
invariance under general coordinate transformations. However, as it has been shown in Ref. [S], one 
can always restore local coordinate invariance by the Stiickelberg trick: in analogy with massive gauge 
theories one introduces a set of Goldstone fields and requires that they transform non-linearly under a 
local coordinate transformation. The fundamental object to be used for this purpose is a symmetric 
tensor H a p, built in terms of the reference metric, the field describing metric fluctuations = g^ u — rj^ 
and the four Goldstone fields 7r M , 

H a p = Kf3 + daTT/3 + dpir a + da^dpTT-f . (5) 

H^ u transforms as a covariant tensor under local diffeomorphisms x a — > x a + £ Q provided that n a shifts, 
n a — * iT a — £ a . As in non-abelian massive gauge theories, since now local coordinate invariance is non 
linearly realized on the tt field, a Lagrangian built using H will be valid as an effective theory and its 
breakdown will appear as the Goldstone sector becoming strongly coupled at some scale A. This indeed 
has been shown in Ref. jOj. It is useful to further split the 4 7r Q fields into a vector and a scalar as 

ti> = + (6) 

together with an additional hidden U (1) gauge invariance for under which shifts. Note that since eft is 
a Goldstone boson under this £7(1) gauge symmetry and 7r M is a Goldstone boson of broken diffeomorphism 
invariance (j) wm appear with two derivatives in the Lagrangian, as evident from eq. ijSjl. A mass term 
for h^ u can be written down in terms of H^ v as 

y/^ggT^iaH^H^ + bH^H a p) . (7) 

Expanding H using (jHJ) one easily realizes that for a generic choice of a, b there is a quadratic term 
in (p containing 4 derivatives. This term signals the presence of a ghost as we have seen in Section |2 
Only for the Fierz-Pauli choice a = —b = m^Mp this four derivative term exactly cancels. In this case (j) 
does not have a kinetic term on its own, but only a kinetic mixing with h^: m^Mpid^dyfyh^ — \3(j)h). 
A conformal rescaling of the metric = + m^r]^^ diagonalizes this mixing and generates a small 
(i.e. proportional to rrig) kinetic term for <p besides interactions of the form (f)(d 2 (j)) n . The smallness of 
the kinetic term is the origin of the low strong coupling scale as it enhances the <f> interactions once the 
fields are canonically normalized. 

One can easily see that the most relevant interactions are of the form 



Mpmi 



m 2 Ml(d 2 ^f = (8) 



where <p c is the canonically normalized field. These interactions saturate perturbation theory at the tiny 
energy scale E ~ A5 = (rrt^Mp) 1 ^ . 

One can slightly improve the situation canceling these cubic interactions by adding H 3 terms. Now 
the most relevant interactions will be of the form (<9 2 </>) 4 and this procedure can be repeated at any 
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order. The dominant interactions will be (d 2 (j)) n and once all these are canceled the theory has the cutoff 
A3 = (ra^Mp) 1 / 3 . In fact after this procedure the most relevant interactions are 

m 2 g M 2 P {dAf{d 2 <P) n and m 2 g M 2 ( \ v + m 2 ,^)^ 2 <\>) n , (9) 

which are weighted by A3 when expressed in terms of canonically normalized fields. 

The interaction between matter and gravity is as usual described by the term ^h a pT a ^. The Weyl 
transformation that demixes hav from (j) thus generates a direct coupling of (f> to the trace T of the 
stress-energy tensor. This implies that we will have a non-trivial (j) background around any astrophysical 
source. For a classical solution we will have two relevant scales: the first is the Schwarzschild radius 
Rs, the distance from the source at which linearized gravity breaks down, the second is the Vainshtein 
radius Ry [5] where nonlinearities for the scalar field (j) become important. For a source of mass M* this 
distance is equal to I/A5 (M*/Mp) 1//5 (which is much larger than Rs)- At this scale the term (<9 2 </>) 3 
becomes as relevant as the kinetic term whereas all the other nonlinear terms are important only when 
we reach the Schwarzschild radius so that they can be safely neglected. Therefore the action of (j) in the 
presence of sources is given by 




the structure of the trilinear terms can be changed by adding non-linear interactions to the Fierz-Pauli 
Lagrangian. 

The situation remains qualitatively unchanged even if the first N (d 2 cj)) n interactions are tuned to 
zero. The first non- vanishing term, (d 2 (j)) N+1 , will set the Vainshtein radius, while higher order terms 
will become relevant again at the Schwarzschild radius. 



4 Massive gravity in the presence of a source 

Let us consider the Lagrangian eq. (|l(Jj) in the presence of a macroscopic source, like the Sun. This 
induces a classical background <&(x), solution of the eft equation of motion. To study the stability of this 
solution we expand the Lagrangian at the quadratic order in the fluctuation cp = eft — <£. The result is 
schematically of the form 



~ V ^ > ^ A_5 



i.e. the background gives a four-derivative contribution to the ip kinetic term. As discussed in sect. |2J 
this results in the appearance of a ghost with an x-dependent mass 



A5 5 

<9 2 $ c (3 



^ghost(^) ~ t^TTZT • ( 12 ) 



Remember that we are dealing with an effective theory with a tiny UV cutoff A5, therefore we should 
not worry until the mass of the ghost drops below A5. In approaching the source from far away, this 
happens at a distance i? g host from the source such that d 2 & c ~ A5 3 . Unfortunately this is a huge distance, 
parametrically larger than the (already huge) Vainshtein radius Ry- In fact for a source of mass M* at 
distances r 3> Ry the background field goes as $ c (r) ~ (M*/Mp) • 1/r, so that 

^ host ~t 5 {m^) >y R y ~ t 5 Up ) ■ (13) 
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Therefore the ghost is going to show up in an extremely weak background field, when the latter is still 
in its linear regime. 

Inside -Rghost, in the spirit of sect. EJ one is forced to postulate that additional physics lighter than 
the local ghost mass cures the instability, that is the cutoff must be lowered from A5 to mg^ os t(x). A 
byproduct of this in general would be that interactions strengthen, being weighted by the new cutoff scale 
rather than by A5. But let us optimistically assume that, instead, the only effect of this new physics is to 
cure the ghost instability. However, when the local ghost mass is of order of the inverse distance from the 
source there is no way of proceeding further without specifying the UV completion of the theory, since 
the background itself has a typical length scale of order of the UV cutoff. One can easily check that this 
happens at the Vainshtein radius Ry. There is no sense in which one can trust the classical solution below 
Ry. Since one can hope to recover General Relativity only in the region inside Ry, where non-linear 
effects can hide the scalar (Vainshtein effect), this also means that General Relativity is nowhere a good 
approximation. 

Notice that in the DGP model the dominant interaction of the Goldstone has the form □7r(9-7r) 2 , 
which suggests the same problem we are facing, as there are two derivatives acting on one of the 7r's. 
Nevertheless, in the equation of motion terms with more than two derivatives acting on a single field 
cancel out and one is left with a (non-linear) second order differential equation [T^] 1 . The same cannot 
happen in our case since there are too many derivatives: the contribution of the trilinear term to the 
equation of motion is a sum of terms with 2 (j>'s and 6 derivatives. In any term there is at least one (f> 
carrying more than two derivatives. 

One is thus led to consider the possibility of eliminating the unwanted trilinear interaction of the 
Goldstone by adding appropriate cubic terms in H^ u to the Fierz-Pauli Lagrangian eq. 0. The three 
independent contractions are if 3 , if (if^) 2 , and (if^) 3 , where the last stands for the cyclic contraction 
of the indices. These contain interaction terms for the Goldstone of the form (<9 2 <^>) 3 which, for the proper 
choice of coefficients, cancel the trilinear interaction of eq. (jlUj) . However, in this way one introduces 
further quartic interactions (d 2 ^) 4 on top of those already present in the Fierz-Pauli mass term, because 
of the non- linear relation between if ^ and (j> of eq. © . These are problematic for exactly the same reason 
as before, and the same problem shows up at any order: an interaction term of the form (d 2 (j)) n evaluated 
around a background gives a contribution to the equation of motion for the fluctuations with too many 
derivatives. This signals the presence of a ghost instability 2 . Again one can check that the cutoff (i.e. the 

^quivalently, working at the level of the Lagrangian, one can expand the interaction term □^(chr) 2 to second 
order in the fluctuation tp around a background 7Tb. The worrisome term is Oipd^ip d^Kb, since it has 3 derivatives 
acting on the tp's. But by integration by parts one can shift one derivative from the fluctuations to the background, 
thus obtaining an ordinary 2-derivative kinetic term for the fluctuations (whose positivity must however be checked) 

021 Q3|. 

2 The reader could wonder if there exists a choice of coefficients such that terms with 4 derivatives on a single 
field cancel in the equation of motion. In this case one would be left only with 3-derivative terms and our 
conclusions should be modified. But this is not the case: setting to zero all 4 derivative terms leads also to the 
cancellation of those with 3 derivatives. To see this, consider the most general interaction Lagrangian of rt-th 
order, D- n > — Y ai ^ 1 '" anl3n d ai df3 t 4>- ■ ■ d an d/3 n (j), where L is a tensor constructed with the metric n^. Given the 
structure of contractions, without loss of generality we can choose r ai/3l '" a " /3 " to be symmetric under ai <-> and 
(a>i,/3i) <-> (<Xj,/3j). Then, for symmetry reasons, the contribution of to the <j> equation of motion is 

paifr-anA, (a Qi a /3i a Q2 a (32 0)(a Q3 a (33( / ) ) + B n (dp 1 d a2 dp 2 </>) (d ai d a3 dp 3 (t>)}d ai dp 4 </>• • -d an dp n (j) , (14) 

where A n , B n are combinatoric factors. The four-derivative term (the first in brackets) identically vanishes only if 
the totally symmetric part of r c * l/3l '" Q "' 9 " in the first four indices does, Y^ ai ^ ia2 ^'" an ^ rl = 0. tn this case, given 
the symmetries of L, it is straightforward to check that r ai < fta!ft ) ---^^ = pCai/^ftO-aWSn = q phis eliminates 
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ghost mass) becomes of order of the inverse radius at the new Vainshtein scale (always defined as the 
distance from the source at which non-linearities become relevant). Hence one never recovers General 
Relativity. 

The only possibility is therefore to concentrate on theories in which all the interactions of the form 
(d 2 (p) n are set to zero by properly choosing infinitely many coefficients. We can look at this procedure 
as an extension at non-linear order of the Fierz-Pauli choice for the mass term which, as discussed in 
sect. 01 leads to the cancellation of the (d 2 4>) 2 terms. In the next section we prove that also in this case 
we cannot avoid higher derivative kinetic terms for the fluctuations of the field <j) around a non trivial 
background: ghost-like instabilities are unavoidable. 



5 Ghost instabilities are unavoidable 

The cancellation of the {d 2 (j)) n interactions has also been considered as a way to raise the strong interaction 
scale to A3 = (ra^Mp) 1 / 3 3> A5 [H1E3- In fact, after the cancellation, the leading interactions are of the 
form 

m 2 g M 2 (dA) 2 (d 2 ct>) n and m 2 g M 2 {h^ + m 2 ^)^)" ; (15) 

when the fields are canonically normalized all these terms are suppressed by the scale A3 , while additional 

(3) 

interactions are weighted by higher scales. Correspondingly the Vainshtein radius now shrinks to R v = 
l/A 3 (M,/M P ) 1 /3 ^ R f as the original leading interactions have been canceled. At this radius all non- 
linear terms of the form (h^ v + m 2 rj^ v (j))(d 2 (j)) n become relevant; on the other hand there are no terms 
linear in A in the Lagrangian, so that A is sourced neither by matter nor by the other fields. Interactions 
involving this field are therefore irrelevant for our purposes, and can be consistently neglected. 

The theory we are describing is not unique: there are different possible choices of coefficients that 
cancel all the interactions {d 2 (f) n . We can easily see why. Let us start with the canonical Fierz-Pauli 
mass term 3 £ 2 = {[H 2 ] - [H] 2 ); since H^ u = h^ v + 2d^d u (j) + d^d a 4>d u d a (j) (setting A^ = 0), £ 2 

contains (<9 2 ^) 3 interactions. We can cancel them adding an appropriate combination of terms cubic in 

we can still add, with an arbitrary overall coefficient 

«3, the expression 

CT = V=9~ (m][H 2 ) - [H] 3 - 2[H 3 ]) , (16) 
because it gives (d 2 (p) s terms in the combination 

(□<A) 3 " 3 □<£ (S M cU) 2 + 2 (d^f , (17) 

which is a total derivative (hence the superscript 'TD') and thus does not contribute to the equation of 
motion. Now £2 + £3 + a s^-s D contains (<9 2 0) 4 interactions and we can repeat the procedure. Fourth 
order terms are canceled by 

At = v^d tz [(5 + 24a 3 )[# 4 ] - (1 + 12a 3 )[# 2 ] 2 - (4 + 24a 3 )[#][# 3 ] + l2a 3 [H 2 ][H] 2 ] . (18) 
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the 3 derivative term (the second in brackets) as well, i.e. eliminates the contribution of to the <j> equation of 
motion altogether. 

3 We use the notation [H] — g^H^, [H 2 ] = g^ g a ^ H^ a Ii v p, and its straightforward generalization to higher 
orders. 
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Again we have the possibility to introduce a second arbitrary coefficient, 04, in front of the 'total deriva- 
tive' term 

£j D = ^([H] A -6[H 2 }[H} 2 + 8[H 3 ][H} + 3[H 2 } 2 -6[H' 1 }) , (19) 

and we can go on at higher orders until all self-couplings are removed. Notice that these total derivative 
terms £™ are the higher order analogue of the Fierz-Pauli mass term: they are combinations of 
interactions that reduce to a total derivative when expressed in terms of d^d u (p and therefore do not 
contribute to high-derivative terms in <j). One can check that there is one of such combinations per order, 
but for our purposes we will need them only up to fourth order 4 . 

We now want to see if it is possible to get a kinetic structure without ghosts for the fluctuations 
around a background. Note that this theory has potentially dangerous terms of the form h^ u (d 2 (j)) n . We 
could hope that the large freedom we have in the choice of higher-order terms, parameterized by the 
coefficients 03, 04, . . ., helps us to obtain a ghost-free theory. Unfortunately, this is not the case and the 
main reason is that the number of possible contractions of H^ u grows very fast and soon we cannot cancel 
all the dangerous kinetic terms. Let us see how this works explicitly. We call h b ^ v and (j) b the background 
fields 5 , and we study the quadratic Lagrangian for the scalar fluctuation ip. Actually, instead of working 
at the level of the Lagrangian, the most direct approach is to look at the equations of motion linear in 
the fluctuations, because in this case there is no ambiguity coming from integration by parts. We have 
to check whether all the terms with more than 2 derivatives on ip can cancel in the equations of motion. 

We start with the Lagrangian terms cubic in the fields: they can come only from £2 + £3 + a^C™. 
Only terms schematically of the form h(d 2 (p) 2 can give a ghost; terms with a higher number of hn V have 
no more than 2 derivatives. Using the explicit expression of the Lagrangian, it is immediate to verify 
that i) the terms in the e.o.m. with more than 2 derivatives on ip cancel already with 03 = 0, and ii) 
they cancel also when originating from ^C™ alone. We conclude that at the cubic level in there are 
no higher derivative kinetic terms for ip and the parameter 0:3 can still be varied arbitrarily. 

What happens with the quartic Lagrangian? Let us consider the interactions h^ u (d 2 cp) s ; now we must 
include also £4 (eq. (fl8|0 . and for simplicity we start with 04 = 0. Again it is straightforward to write 
down the equations of motion linear in ip. The terms with more than two derivatives on the fluctuation 
can be divided into two classes: those containing {h b )^ /t and those in which (h b )^ is contracted with 
derivatives of (p. Either class must cancel independently of the other, since, given the different tensor 
structure, there is no possibility of cross-cancellation between the two. In the e.o.m. the terms belonging 
to the first class automatically cancel. They come from a piece of the Lagrangian that is precisely 4a3 [h] 
times the 'total derivative' combination eq. (|17|) . The remaining dangerous kinetic terms, which belong 

4 It is easy to check that at n-th order a total derivative term is given by 

. ..qOnvVn) d ai 8^---d an d p ^ , (20) 

where the sum runs over all permutations n of the (3 indices, and (— l) w is the parity of the permutation. To prove 
that this combination is the only total derivative term at a given order assume that there are two of them. One 
then could construct a total derivative term that does not contain, say, (□</>)". Imposing that the contribution to 
the field equations is zero, it is straightforward to show that also all the other terms vanish. 

5 Remember that the field h^ v is the graviton before the Weyl rescaling that demixes it from <p: = + 
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to the second class, come from the Lagrangian 6 



(8 + 72a 3 ) ([h<j> 3 ] - [h^M) - 36q 3 ([h #</> 2 ] - [h 4>}[4>} 2 ) . (21) 

Can we add now a 4 Cj D (eq. (JUJ))? Yes, since this does not reintroduce terms of the first class (i.e., 
containing (/i) M ^) in the equations of motion. Then, can we choose a 4 to get rid of the contributions 
coming from eq. (|21|l? Unfortunately the answer is negative. In fact, expanding ev 4 £j D , 

a 4 Cj B D -192a 4 ([hcj) 3 } - [h cj) 2 ]^]) + 96a 4 ([h #</> 2 ] - [h^] 2 ) , (22) 

one immediately sees that either the first or the second pair of terms in eq. (|21[) (but not both) can be 
canceled by properly choosing a 3 and a 4 . The other pair gives a non-zero, four derivative contribution 
to the equation of motion for the fluctuation (p. The number of possible tensor structures is bigger than 
the freedom we have in the Lagrangian. This completes the proof that massive gravity around a generic 
background cannot have a purely two- derivative kinetic term for (p. 
In the A3 theory the local mass of the ghost goes as 

A 6 

-ghost(x) ~ nx) p nx) . (23) 

so that the ghost enters in the effective theory at a distance from the source much bigger than the 
Vainshtein radius, 

These results should be compared with their analogues in the A5 theory, eqs. (|12j) and (|13|), Again at 
the Vainshtein scale the mass of the ghost is of order of the inverse radius and we cannot proceed further 
towards the source. 

There is a final subtlety that needs to be addressed. Is the presence of a 4-derivative kinetic term 
enough to claim that there is a ghost? After all, the argument of sect, |2]strictly applies only to a Lorentz- 
invariant background. It is clear that if the derivatives acting on the fluctuation <p are contracted with 
a background tensor field different from r]^ v the situation can be very different. For instance, if the 
quadratic Lagrangian for ip involves 4 space derivatives but only 2 time derivatives there is no room 
for an independent propagating extra scalar (x, in the language of sect.EJ), and thus there is no ghost, 
provided that the (p 2 term has the healthy sign. Indeed, in most astrophysical situations macroscopic 
sources have non-relativistic velocities w<l; the background 4> b field they generate is therefore essentially 
constant in time, its time derivatives being suppressed by positive powers of v with respect to its spatial 
gradients. This, in a term like d^d u 4> d ll d p pd p d v p for instance, can suppress the magnitude of terms 
with 4 time derivatives. Although this is a parametric increase of the ghost mass, for typical astrophysical 
velocities v ~ 10 -4 — 10 -3 the regime of validity of the theory is not significantly widened: inside the 
Vainshtein radius the theory breaks down at r ~ Ry^v 4 / 5 ~ (10 -3 — 10~ 2 )Ry\ where the ghost mass 
becomes of order of the inverse radius. 



6 Inside brackets with <fi we indicate the matrix d^d^cj). For example the term [/i</>][0] 2 should be read as 
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6 Unitary gauge description 



In the previous section we argued in the Goldstone language that Fierz-Pauli massive gravity (together 
with all its infinitely many higher order extensions) is unavoidably plagued by ghosts around a tinily 
curved background. We now want to see how the sickness of the theory is interpreted in the unitary 
gauge. In order to take into account the presence of a (slightly) curved background we need to consider 
the field equations at second order in h^ v , 

hr, + bhr)r, + O(h%,)]=0, (25) 

where the quadratic terms come both from the mass term (that we take generic for the moment) and 
from additional higher order interactions. As it is well known, the invariance of the Einstein-Hilbert 
action under diffeomorphisms g^ v — > g^ u + V^ej, + V^e^, 

5S EH = 2jd*x 6 -^V,e u = -2 f d'x^g e v V, ( J_ ^ ) = , (26) 

implies the contracted Bianchi identities V^G^u = 0. As a consequence, from eq. (|25|) we get the four 
constraints 

V[aV+^V + 0(^)]=O, (27) 

which reduce to six the number of propagating components of /i™. The presence of these constraints 
is ensured by the gauge-invariance of the 'kinetic' action. This is in complete analogy to what happens 
in the theory of a massive vector particle, where the gauge invariance of the kinetic term implies the 
constraint d^A^ = 0, thus reducing to three the number of propagating degrees of freedom. 

If we now restrict to a linear analysis, for the particular Fierz-Pauli choice of the mass term (b = —a) 
we have an additional constraint equation. In fact the linearized Einstein tensor satisfies 

G^ = ^(V-H-) > (28) 

so that eq. (|27|) forces G^ 1 „ to vanish at linear order, and thus the trace of the Einstein equations eq. (|25jl 
becomes a constraint for the trace mode, 

h = . (29) 

In the end one is left with five propagating degrees of freedom, the correct number for a massive spin-2 
particle. However, it is clear that this last constraint is fundamentally different from the previous four, 
since, unlike them, is not ensured by any symmetry, but it is based on a precise tuning in the structure 
of the mass term and on the identity eq. (|28[). valid at linear order. It is then natural to expect that it 
does not survive in a curved background. The ghost we found in the Goldstone language is nothing but 
this hidden sixth mode that, although constrained in flat space, starts propagating around a non-zero 
background. Let us check that the energy scale at which the additional mode appears is indeed the same 
in the two descriptions. 

At quadratic order G^^ will not vanish anymore on the equations of motion, instead it will be of the 
form ~ d 2 h 2 . Therefore at quadratic order the constraint eq. (|29|) becomes a dynamical equation, 

0{d 2 hl v ) + m 2 g [h + 0(h^ 2 )] =0. (30) 

If we now write h^ u as the sum of the background field and fluctuations around it we see that the equation 
above describes the propagation of a mode with mass 

o 

7YI 

m 6«- m0 de( x ) ~ n^(x) ' (31) 
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where TC^u is the background metric. Notice that in the limit of zero background the mass goes to infinity 
and the mode decouples, as expected from the linear analysis 7 . 

In order to compare this with our results in the Goldstone language we must relate the background 
Tifxi/ix) in the unitary gauge to the Goldstone background <I> c (x). This is easily done by getting rid of 
the Goldstone, i.e. by performing a gauge transformation with parameter = ^rj^ d^ - Considering 
for definiteness the case of a spherical source, the background in unitary gauge is therefore the sum 
of the usual Schwarzschild solution of GR TL^ u (r) (corrected by the kinetic mixing with <£, hence the 
vDVZ discontinuity) and of the pure-gauge longitudinal contribution ^ij— 5 A1 9 i/ $ c (r). Both 7^, and 

jj^& c scale as Rs/r outside the Vainshtein radius, so that the pure-gauge contribution coming from the 
Goldstone is the dominant one in unitary gauge, 

H ^(r) ~ -4r^^ $c (^) ~ — » n U r ) ■ ( 32 ) 

nigMpi \ m g r ) r p 

This means that the mass of the 'sixth mode' eq. (|3Tj) is dominated, as expected, by the 3> background, 

„ m^Mpi A 5 5 

"V-mode^) ~ Q^-c ~ ffl^c • ( 33 ) 

which is exactly the mass of the ghost eq. (fT2|) we found in the Goldstone language! 



7 The sixth mode in the ADM formalism 

As we did in the Goldstone language we now show directly in unitary gauge that it is not possible to 
forbid the propagation of the sixth mode by adding properly tuned higher order terms to the action. 
We do this in the Hamiltonian formalism, where the counting of degrees of freedom is explicit. Let us 
introduce the ADM variables {N, Nj, g^} (TU], 

N = l/yfcjn , Nj = g 0j , (34) 

and gij is the 3D metric induced on spatial hypersurfaces of constant t. It is well known that in GR A" 
and Nj are not dynamical fields: the Einstein-Hilbert Lagrangian does not contain their time derivatives, 
so that their conjugate momenta vanish identically. Moreover in the Hamiltonian they appear linearly, 
as Lagrange multipliers. Therefore their equations of motion are really constraint equations for the other 
degrees of freedom g^ and their conjugate momenta tt ij , rather than equations for A" and Nj. These are 
the so-called momentum and Hamiltonian constraints. The Hamiltonian system is thus reduced to two 
independent (q,p) pairs, which describe the two graviton modes. 

If we now perturb GR by adding a mass term for h^u, or generic interactions involving only hp V and 
not its derivatives, the Lagrangian still does not contain time derivative of A" and Nj. However in general 
N and Nj now do not appear linearly in the Hamiltonian. In this case their equations of motion are now 
determining their value rather than constraining other degrees of freedom. This raises to six the number 
of d.o.f. of massive gravity. The Fierz-Pauli tuning of the mass term precisely corresponds to setting to 
zero the N 2 term in the action, so that (at quadratic order) the variation with respect to A^ still gives a 

7 A similar result has been obtained in ref. |18| . where it has been interpreted as an instability of arbitrarily 
short time-scale of Minkowski space. This conclusion is not justified from our effective theory point of view. 
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constraint equation, eliminating the unwanted 'sixth mode'. We want to see if similar tunings can work 
at all orders. 

Expressed in terms of the ADM variables the metric fluctuation = — r]^ y takes the form 

where hy = gij — 5ij, and g lJ is the inverse of c/ij. We are going to work perturbatively in 5N = N — 1, 
Nj and hij, so from now on spatial indices are contracted with the Euclidean 3D metric 5ij. Notice that, 
given the non-linear relation between h^ u and the ADM variables, a generic re-th order expression in h^ u 
also contributes to orders higher than n when expressed in ADM variables. 

Quadratic Terms. At quadratic order in hn V the most general Lagrangian is 8 

C 2 = a 2 [h 2 } + b 2 [h] 2 . (36) 

By plugging eq. Q35|) in this expression we find the term proportional to 5N 2 , 

£2 D 4(a 2 + 62) SN 2 , (37) 

hence the Fierz-Pauli tuning b 2 = —a 2 to set it to zero. The coefficient a 2 fixes the mass of the graviton, 
and for our purposes we can take it to be 1. At quadratic level (in ADM variables) the problem is solved, 
but C 2 also contributes to third and fourth order terms; in particular it contains a term —2ha 5N 2 , which 
upon variation with respect to SN gives rise to an equation for SN itself rather than to a constraint. We 
are therefore forced to introduce cubic terms in huv- 
Cubic Terms. The cubic Lagrangian is 

C 3 = a 3 [h 3 ] + b 3 [h][h 2 ]+c 3 [h} 3 . (38) 

Cubic terms in ADM variables come both from this and from C 2 . In particular, those involving more 
than one SN are 

£ 2 + £3 D (12c 3 + 4b 3 - 2)h u 5N 2 + 8(a 3 + b 3 + c 3 ) 5N 3 . (39) 
We can set both of them to zero by choosing 

a 3 = 2c 3 - \ , 63 = \ ~ 3c 3 ■ (40) 

This agrees with what we found in the Goldstone language, and again the coefficient c 3 is still undeter- 
mined. Now we are forced to introduce quartic terms in hn V to cancel undesired quartic terms containing 
5N n coming both from C 2 and C 3 . 

Quartic Terms. The quartic Lagrangian is 

£4 = a 4 [h 4 ] + b 4 [h][h 3 } + c 4 [h 2 } 2 + d 4 [h] 2 [h 2 } + e 4 [h} 4 . (41) 

Again, we are only interested in terms involving powers of 5N larger than 1. For symmetry reasons these 
must be of the form 

£2 + £3 + £4 D [A hi + B hijhij + C NjNj] 5N 2 + D h u SN 3 + E5N 4 . (42) 



8 In this section all contractions are done with the flat metric rj^ and there is no y/—g in the action. Different 
conventions are equivalent in unitary gauge: they just reshuffle the coefficients in the expansion in h^ v . 
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After a straightforward but lengthy computation we find the relationship between the coefficients (A, . . . ,E) 
and (c3,a 4 ,...,e 4 ), 
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/ o \ 

1/2 
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V o / 



(43) 



We would like to set the vector (A,...,E) to zero. Since we have 6 free coefficients (03, a 4 , . . . , e 4 ) 
to choose and only 5 conditions to satisfy, one naively expects this to be possible and one of the free 
coefficients to remain undetermined. On the contrary, it is impossible. This is because the matrix above 
has rank 4 and the space spanned by it does not contain the inhomogeneous term. There is no way of 
choosing (03, a 4 , . . . , e 4 ) to make the unwanted expression eq. (fi2l vanish. 

In summary, we tried to tune all interactions KL, in order to keep the Hamiltonian linear in iV (or 
equivalently in 8N), this to ensure the presence of a constraint equation that eliminates the troublesome 
sixth degree of freedom. We found that when fourth order terms are taken into account this tuning is 
impossible. This agrees with our result in the Goldstone language, namely that it is impossible to tune 
fourth order interactions to avoid the propagation of a ghost. 

Notice however that the ADM analysis we sketched in this section is useful to explicitly count the 
number of degrees of freedom but, unlike the Goldstone approach, gives us no clue on the typical mass of 
these modes. From the effective field theory point of view this additional information is crucial: a massive 
mode with a mass above the cutoff can be consistently discarded, even if it is a ghost or a tachyon. Also 
we have not checked in this language that the sixth mode is a ghost. To do so one should compute the 
Hamiltonian and see that it is not positive definite. This approach is rather cumbersome and it has been 
carried out in ref. |2j only for a limited set of non-linear terms, namely functions of the Fierz-Pauli mass 
term: f(h%, - h 2 ). 



8 Concluding remarks 

It this paper we have shown that in massive gravity the classical solutions around a source are plagued 
by ghost instabilities. This holds for any choice of the non-linear terms one can add to the Fierz-Pauli 
action. It is known that massive gravity is an effective field theory whose UV cut-off can be pushed at 
most up to A3 = (m^Mp) 1 / 3 . In this optimal case the ghost instability enters in the effective field theory 
at a distance from the source -Rghost ~ I/A3 • (M^/Mp) 1 / 2 . This distance is huge, much bigger than the 
Vainshtein radius Ry ~ I/A3 • (M^/Mp) 1 / 3 . For instance taking m g ~ Hq, for an astrophysical source 
like the Sun i? g host ~ 10 22 km, of order of the cosmological horizon! One could optimistically postulate 
that new physics enters at energies of order of the (local) ghost mass and cures the instability; even under 
this hypothesis, when the mass of the ghost becomes of order of the inverse distance from the source 
there is no way of proceeding further without specifying the UV completion of the theory. This happens 
at the Vainshtein radius (of order 10 16 km for the Sun); as the vDVZ discontinuity can be cured only 
inside the Vainshtein radius, General Relativity is never a good approximation. 

One could argue that the very low cut-off of massive gravity is already bad enough to disregard the 
theory. For instance, if one assumes that the theory has a generic series of higher dimension operators 
suppressed by the cut-off, these will become important when evaluated on a classical solution at huge 
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length scales, parametrically bigger than the inverse cut-off 0. This implies that it is impossible to 
calculate the classical solution around a source without specifying the UV completion. However this 
problem depends on the high energy completion of the theory, and its precise formulation is subtler than 
what one can argue at first sight. For example in the DGP model, which in this respect is very similar 
to massive gravity, one can make consistent assumptions on the higher dimension terms which make 
predictions independent of the UV completion ^3]. Of course a prerequisite for this to work is that 
the classical theory is free from pathologies and instabilities. As we have shown this is not the case in 
massive gravity: ghost instabilities are unavoidable. The theory is inconsistent already at the classical 
level, before taking into account quantum effects. 
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